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ABSTRACT 

This study deals with the distribution of mortality 
risks in Canada in the period 1975-77. The description is based 
on nonparametric estimates of probability densities of risks of 
death in five year age groups. These estimates are derived from 
small area, age specific mortality data after adjustment for 
sampling variability. 

Most variability in the mortality data represented 
variability in risks rather than instability in estimation from 
small populations. The variability in risks is represented in 
quantile life tables. These are collections of life tables, each 
of which is based on risk estimates at a fixed quantile of age 
specific risk distributions. 

Summary measures of the distribution of risks are 
provided by estimates of quantiles of expectation of life at 
birth and of median age at death. Interquartile ranges for life 
expectancy are estimated to be 3.2 years for males and 2.8 years 


for females. 
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1. Introduction 

Phasiwistudyie ismeconcerned awithse, theswestatistical 
description of variability among small area, age specific 
probabilitless of sdeathiud(%)imenin- the. age .interval ¢.[x,x+tn) , 
conditional on survival to age x. In most cases, n equals five. 
The data, which are averages of annual data over the 1975-77 time 
period, comprise estimates nd (X) for both sexes in each of 259 
geographic areas in Canada. The geographic units are census 
adivisions .(CDs)* which. correspond. .to . counties or other 
subprovincial administrative units. The exact definition differs 
among provinces. 

The 259 CDs represent all regions in Canada with the 
exception of the Yukon and Northwest Territories and the most 
northerly part of Quebec (Nouveau-Quebec). Although the 
populations of many of the CDs are large in statisticians’ terms, 
they are generally smaller than demographers usually consider 
adequate for construction of life tables. Demographers often 
assume that populations of 100,000 or less provide unstable 
estimates of mortality probabilities. 

The study has two features which are not common in life 
table analysis. First, estimates of ,q(x) are assumed to be 
Sib) cote t OmeVvanvabilit Vee wnichw. js) sanalogous «to. sampling 
variability. Second, the observations are assumed to come from a 
population which has a heterogeneous distribution of mortality 


risks. Heterogeneity refers to differences in risk levels and 


arises from differences among individuals (e.g., diet, life 
style, etc.). 

Bach= "CD "as" treated Yas**a™“sample™= of the sscanadian 
population “which® Ws selectéd@m™without regards to Mmontalley 
conditions. Consequently, each nd (x) is an average taken over 
individual s*=whose smoutali ty erisks @*aLre"s assumed =to gabe 
heterogeneous. The sampling variance of nd (x) is determined by 
variance of unobserved risks within CDs and by CD population 
size. There are marked differences in the sizes of CD 
populations. According to the 1976 census, the population totals 
of CDs in the ‘study ranged from 1500) to. 2.4e=mel Pron. (nus ea 
description of the distribution of risk involves simultaneous use 
of q(x) estimates which are not equally reliable. 

The study is directed towards more fully describing 
mortality conditions within national populations. Within each age 
interval, nonparametric estimates of probability density 
functions of mortality risk are calculated. These estimates 
provide a flexible means of assessing inequality in mortality 
risks (e.g., by quantile based dispersion measures). 

As an outcome of density estimation, period life tables 
are constructed by linking the estimates of age specific risk 
that correspond to a given quantile. These are termed quantile 
life tables. This approach differs from the more conventional 
linkage of estimates of age specific risk corresponding to a 
geographically defined population. The quantile life tables are 


calculated for grouped ages (i.e., forming abridged quantile life 
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tables). Abridged quantile life tables serve to illustrate an 
improved description of mortality for a heterogeneous national 
population. For purposes other than illustration, quantile’ life 
tables by single years of age would be preferred. 

The emphasis in this article is on quantile life 
tables. A justification for basing them on CD data requires that 
allowance be made for differences in reliability among nd (X) « 
That is, the life tables should reflect heterogeneity in 
mortality, rather than the instability of small area estimates. A 
further requirement is that the variability among CD means be 
random, so that it is valid to describe the distribution of risks 
by a probability density. The two sections immediately following 
address these aspects of the estimation problem. Section 2 
Seacl Desisar lemy.d(x)mmecstimatOlrmmandgg dns approximation of gsits 
variance. In section 3, it is argued that geographic grouping of 
the data can provide a basis for estimating the distribution of 
risks in the Canadian population as a whole. 

EScimation Oi chemaiscributionssofemwrisks’ is7eing.two 
stages: (1) section 4 describes adjustment of observations for 
sampling variability by a linear empirical Bayes procedure, and 
(2) smooth nonparametric estimates of the probability density of 
risks, by age, are obtained in section 5. Section 6 provides 
selected illustrative results for two male age groups. Section 7 
presents results derived from selected quantile life tables. 


Finally, concluding remarks are presented in section 8. 


2. Sampling Variability 


Ascribing ‘sampling’ variance to estimates of mortality 
probabilities may be controversial, if death registration is 
complete and census undercoverage is negligible. In that case, 
mortality rates might be regarded as exact values. This view 
makes no allowance for nonrepeating and essentially unpredictable 
factors. Iinftluencings a mspopulation ss moctal 1 Cy 1nedusg ivensmcimc 
period (e.g., extremes of weather, etc.). Brillinger (1986) 
provides a rigorous basis for describing ‘natural variability’, 
and applies one model to the mortality of the Canadian female 
population, 1926-82. A variety of alternative models are 
presented in the discussants’ comments following Brillinger’s 
paper. 

Observed values nd (X) 5 for the j’th CD are represented 


following Chiang (1972) as (using actuarial notation): 


n9(X)5 = nD(x)4/ ( £ P(x) + (n-na(x)) pDOX)3 1 7 1) 
= nD(x)5/K(X) 5, (1) 

where, nD(X) 5 are average annual deaths in the interval 1975-77 
to those aged x to x+n, nP(X) 5 is the 1976 census population, and 
n@2(xX) represents average years lived in the age interval by a 
person who died in the interval. The denominator in Eqn. 1 is an 
estimate of the annual population exposed to risk. In the 
remainder of the paper, K(X) 5 will be treated as if it were the 
true value. 


The counts of deaths by age and CD were obtained from 


Special tabulations provided by Health Division, Statistics 


3) 
Canada. Population counts were obtained from the 1976 Census 
database. Note that population counts have been randomly rounded 
COSmmMULCIpLeSo, Ofsmetive sas required atone public suse: “census 
tabulations. 

POrsegemdtOUps | OF iviratide | ly > )ter a(x )/ewasmueriveds from 
abridged male and female life tables for Canada, 1975-77 (Nagnur, 
1986). For all other age groups, ,a(x) was assumed to take the 
value 2.5 (i.e., assuming a uniform distribution of deaths). The 
Paucers: represents® altifirst S-approximation. Other, more 
sophisticated, approximations are available (see, for example, 
Keyfitz and Frauenthal (1975)), but the appropriateness of these 
methods applied to small populations that are subject to 
substantial migration can not be readily evaluated. Consequently, 
only a first approximation is employed in this study, a strategy 
advocated by Hoem and Jensen (1982) and others. 

Hereafter the subscript n, denoting the width of the 
age interval, is dropped and notation representing dependence on 
age is eliminated to simplify expressions. All subsequent 
analysis applies independently to each age interval. 

A statistical model of the variability of qs can be 
expressed in the following terms. Let Qi; be a Bernoulli random 
variable representing possible mortality outcomes for the i’th 
individual in the j’th CD in the appropriate age interval. Q35 
takes the value 1 (representing a death) with probability qj and 
the value 0 with probability 1-qi3- Thus, qi; represents a 


personal risk of death. 
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The expected count of deaths in the j’th CD is given by 
the sum (over i) of personal risks. The observed count (1i.e., D5) 
corresponds to the sum of realizations qi5 (taking value 1 if the 
person died and 0 otherwise) of the random variable Qj4, so that 
Eqn. 1 provides an estimator of average risk. The true variance 
of the observed count (assuming independent risks) is given by: 
Var(Dj) = z qij (1 - ais), (2a) 
which may be rewritten as: 

Var(Dj) = Ky q-3 (1-q-5) - é (ai; = Cig) (2b) 
where q-5 is the average of personal risks. Eqn. 2b shows that a 
binomial variance of Ds gives the maximum variance among all 
distributions of Kj personal risks with the average q-4- The 
accuracy of the binomial approximation may be improved by 
reducing the width of age intervals, as long as K; remains large. 
The result depends on all qij becoming small in the limit. Note, 
however, that the assumption of independent risks may be too 
strong. 

The remainder of this paper will concern itself with 
subsequent stages of analysis, under the assumption that the 
binomial approximation is adequate. Then, the approximation to 
the sampling variance of qs is q3 (1-45) /K3- Note that differences 
among Ky (i.e., sample size) contribute to differences in 
reliability among a5, and that is the basis for describing the 


variance component as ’sampling variance’. 


J 
3. Interpreting CD Mortality Differentials 

The construction of regional life tables and the 
presentation of mortality data in atlases can give the impression 
Chate LU @iSmeappropriates, tom treat. geographically » distinct 
POpUlLaAcLONSs scSePleeetneyeaweres Closed. tow migration |.ol,.as), if 
mortality differentials somehow depend directly on region. But, 
Since the geographic boundaries of CDs are determined without 
reference to mortality conditions, differences among q5 qdomnot 
represent systematic differences among CD populations. 

Even where some mortality risk factors are clearly 
associated with geography, it does not necessarily follow that 
geography provides an important basis for stratifying the 
population at risk (i.e., forming subpopulations with relatively 
homogeneous risks). To the extent that the risk factors involve 
personal habits or life style, they can be expected to vary 
substantially within CD populations. 

One illustration of the issue is the case of malignant 
cutaneous melanoma. Cumulative exposure to the sun is a risk 
factor in some skin cancers, and purely geographic factors such 
as altitude and climate may be directly relevant. However, 
overriding influences in the case of melanoma involve individual 
sensitivity and intermittency of exposure. Consequently, 
homogeneous risk strata could be more accurately represented in 
terms of occupation (indoor workers at greater risk than outdoor 


workers), sensitivity (fair skin, tendency to burn, etc.), and 
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socioeconomic status (recreational tanning, winter vacations, 
etc.), than solely in terms of geography (Elwood et al. (1985)). 

The characteristics of individuals, whose membership 
in one CD population rather than another is the outcome of 
arbitrary grouping, are a source of non-systematic variability 
among estimates of average risk. The grouping may correspond to a 
random space lonsssOdeec new e Canadian mmepOD bat] One lt Dele sence 
distribution of CD averages will reflect the distribution of 
risks of typical individuals in the national population. That is, 
ay —probability density sisPsanseappropriate idescription. of the 
distribution of CD averages, since the source of variability has 
a frequency interpretation. 
4. Linear Empirical Bayes Adjustment for Sampling Variability 

Linear empirical Bayes procedures (LEB) may be used to 
estimate vectors of random means. A simple form of LEB procedure 
can be represented as estimation of a random parameter vector @ 
by regression on a vector of observed data y (Robbins, 1983). The 
goal is to minimize E{ @(y)-@ }2, where @(y) is a linear function 
of y estimating ®. In the present application, LEB procedures 
involve shrinking each a toward the centre of the distribution. 
The magnitude of shrinkage is large, if the sampling variance of 
a given qs is large (i.e., providing an adjustment for sampling 
Variability) 

Shrinking estimates in the direction of their overall 
average provides a reasonable compromise between using the 


overall average to represent every CD (i.e., effectively assuming 
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that variance between CDs is due to sampling variability) or 
accepting each CD estimate without modification (i.e., 
effectively assuming that CDs represent unrelated populations). 

A variance stabilizing transformation is employed so 
that each q4 will have (asymptotic) sampling variance independent 
of q.3. Variance stabilization is given by Anscombe’s (1949) 
modification of the arcsin transformation: 

t; = 2 arcsin(,{ (Dj + 3/8)/(K; + 3/4) ). (3) 
Anscombe’s modification gives added stability for CDs with small 
K, but its effect is negligible for most CDs in this application. 
Similar use of an arcsin transformation applied to binomial data 
in an LEB context is found in Efron and Morris (1975). 


Variability among CD averages is represented in the 


model: 

t; a so + (t5 =.) ert uj = te. + ej, (4) 
where, ts = 2 arcsin( { q.5 ): 
Note, Var CCT mmo) NV vetin kK, where 1/Kj; represents binomial 


(sampling) variance of us given the arcsin transformation and v 
represents variance in excess of sampling variance (i.e., 
variance among CD deviates (t3-t.)). Further note that 
Cov (t;,t;)=v. 

Estimates of t. and v are obtained iteratively. The 
estimate t. employs the inverse of the estimated variance of each 


t, as a weight: 
t.= 5 (t; wmj ) /2 wa;, (5) 


18) 
where wmj = Li e( Vv + 1/K; ) with summation over m CDs. Note that 
att v were large enough the weights would be nearly equal (i.e., 
approximately independent of K). 
An estimate of v may take the same form as would a 
maximum likelihood estimate assuming normality with correction 


factor m/(m-1). A convenient computational form is: 


Vier = (m/(m-1)] & [ (tj-t) wy J? / S wy, (6) 
where WV ts LaiyA(Gik: ap 1/(¥4K5})], with vy a current estimate and 
Vi the previous estimate. Note, E(v) = v whether the deviates 


(t5-t) are normal or otherwise. Given a starting value for tus 
Eqn. 6 may be solved iteratively. Then t is estimated from Eqnivms 
with new weights, and the cycle is repeated to convergence. 

Table 1 presents selected results of the variance 
computations. The table shows coefficients of variation (CV) and 
percentages of variance in excess of sampling variance (i.e., 
termed % Excess) for each age group. These are summary measures 
which provide a comparison of standard non-sampling deviations to 
the average and excess variance relative to sampling variance, 
respectively. CVs range between 8% and 28%, indicating that 
deviations from the average are typically substantial relative to 
the average. With the exception of ages 0-1, the % Excess values 
arejlall woreatemmthangs/02 Wives oeindicataungmthaLt mostalotm the 
variability exhibited in the data is non-sampling variability). 
These results clearly indicate that there are important 


differences among the mortality levels exhibited in the CD data. 


AE, 
Correspondingly, the overall average is not a uniformly adequate 
indicator of risks experienced in the population. 

Standard LEB estimates of risk (q’) may be obtained 
from: 

Pajee2 arcsin({q’5) =t+ [Vv / (v + 1/Kj)] e;, (7) 
where the shrinkage factor applied to e4 estimates the ratio of 
Cov (t;,t) Lo var (t3) (1.e., analogous to a regression slope). 
Extending the regression analogy, residual variance var(t’4-t;) 
may be expressed as var (tj) - Cov (tj, tj) 2/var(ts) and is 
estimated by [v / (Vv + 1/K;)] (1/K}). 

A number of authors (e.g., Louis (1984) and Gaver and 
O’Muircheartaigh (1987)) have expressed concern that standard LEB 
estimators shrink too far. For example, the observation that the 
sample variance S(t's - t)2/m-1 is less than v implies that the 
empirical distribution function (EDF) of t’s will not approximate 
the EDF of t5- Louis has proposed a restricted LEB estimator 
which provides an approximation of the EDF of t4, that minimizes 
distance from the standard LEB estimates (i.e., minimizing 

& wes (t*4-t'5)2 ): 
t*5 = 2 arcsin({q*5) = [wej/(wej+d,)] t’3 + [d2/(wej+d))], (8) 
where wes is an estimate of 1/Var(t’5-t5). 

The parameters d, and d5 are lagrangian multipliers 
chosen so that the sample average emer & t*5/m and the sample 
variance @(t*;-t*.)?/m-1 match t and v, respectively. Values of 
d, for each sex and age group were obtained by a secant 


algorithm. At convergence approximate values of dj, gave sample 


LZ 
standard deviations that differed from v by less than 0.00001 
(i.e., less than 0.01% error in all cases). Parameter dj given dj 
was calculated by rearrangement of Eqn. 8 and provided sample 
averages that agreed with t exactly in each case. 

The restricted LEB estimates t*5 are reasonable 
estimates of risk quantiles, in as much as they are each close to 
the corresponding standard LEB estimate and agree closely with 
the estimated mean and variance of ce Thus, the t* 5 provide a 
basis for approximating probability density functions of 
mortality risk given adjustment for differences in the 
reliability of q5 
5. Risk Density Function Estimates 

Non-parametric estimation of a probability density 
function of mortality risk is intended to provide an indication 
of the characteristics of the distributions that would have to be 
accounted for in an appropriate parametric model. In that sense, 
this stage of analysis is exploratory. However, density 
estimation can also provide pseudo-likelihood location and scale 
estimates of mortality risk, and aid interpolation to equally 
spaced risk quantiles. 

The goodness of fit of a simple parametric reference 
model, ,of §t* amay, bessassessed ‘from /thes.linearity or Lack 90f 
linearity in the association between t* order statistics and 
expected reference quantiles. In the present application, the 
reference quantiles (denoted hereafter as z) are from a beta 


density function. The beta density has the form: 


3) 
£{(Z}§=mLe2ceom Clo eee eBeta(b,c) , (9) 
where Beta(b,c) is the beta function. Beta densities can 
accommodate a wide variety of distributional shapes and are 
suitable for modeling bounded random variables (t* is bounded by 
zero and ‘Wf ). The parameters b and c are chosen to match SPs and 
v/ ry : 

Silverman (1985) describes a smoothing cubic spline 
algorithm for nonparametric regression which will be used here to 
provide both smooth estimates of t* and smooth estimates of first 
derivatives dt*/dz. The algorithm minimizes the vector function: 
(1-h) [t*-t*(z)}? w c(t*-t*(z)] + oh (etm (zy2 dtm (zr (10) 
where t* is a vector of order statistics, Cadz  Vsmeamviector, of 
fitted values (from regression on 2), t*"(z) represents second 
derivatives of the fitted values, and W is a diagonal matrix of 
weights. The derivatives dt*(z)/dz estimate the ratio of the 
(known) probability of z to the probability of t*(z). Thus, non- 
linearities in the association between t* and z can _ be 
represented by an empirical density differing from the fitted 
beta density. 

The parameter h in Eqn. 10 is a smoothing parameter 
that differentially weights local accuracy and global smoothness. 
As h approaches 1.0, the algorithm represents the relation 
between t* and z as a location-scale transformation (i.e., 
linearity implies t*"(z)=0). As h approaches 0.0, the algorithm 
implicitly includes as many parameters as there are observations 


(i.e., providing an interpolation where ¢*-t*(z)=0). An 
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appropriate value of h can be obtained bY eMminimizingmecLosse 
validation errors (Silverman, 1984). 

As the fitted values are estimated quantiles, W is 
available from the inverse of estimates of t*(z) variances: 

Var{t*(z)} = s* [ (p(1-p)) / (m £{t*(z))?) 1, (11) 

where p is an appropriate EDF estimate, m is the number of 
observations, and s* is a scalar (Parzen, 1979). The probability 
ordinates f{t*(z)} are obtained from the probability ordinates of 
z Givided by the corresponding estimate of dt*(z)/dz. Thus, the 
spline smoothing algorithm can be iterated, updating W each time 
from improved estimates of f({t*(z)} and s*. Minimization of 
cross-validation errors and iterative use of Eqn. 11 led to 
values of h (for each age group) that were in the range 0.76 to 
GES hS)h, 

Details of similar use of order statistics and model 
quantiles for nonparametric density estimation are found in 
Bofinger (1975) and Parzen (1979). The use of spline smoothing is 
justified by the interpretation of the spline as a moving average 
(kernel) smoother with a local bandwidth proportionate to the 
local variance (Silverman, 1984). 

The estimates az) are subsequently employed to 
construct life tables. For these purposes, qe is obtained by 
inverting the arcsin transformation (i.e., q*=[sin( t*(z)/2 )]? ) 
and estimates of the probability density function of q* are 
obtained by applying the Jacobian of the inverse transform to 


Ciel ae 


+5 
6. Illustrative Results 

In this section, intermediate results are presented 
which illustrate the analysis. Figure 1 provides a plot of q 
versus q* for males in age groups 0-1 and 50-54. These age groups 
were chosen because the distributions were well separated, but 
not so much as to completely obscure detail (i.e., by reducing 
CieselesOluctonmrot stne =plot). =COrresponding= plots —oLmempirical 
density ratios are presented in Figure 2. The density ratios, 
estimated from the smooth first derivatives obtained above, 
emphasize differences that may exist between empirical mortality 
risk density functions and transformations of the fitted beta 
density functions. In that sense, the density ratios aid an 
evaluation of goodness of fit. Note that the plotted density 
ratios have been adjusted for the constant scale factor’. 

A general index of the effects of shrinking and 
smoothing can be provided by the slope of a resistant line 
Prec ing qs as a function of an The slope for males aged 0-1 is 
1.33 which indicates that the dispersion of unadjusted mortality 
riskses sc setuyprcally= Soe" -qreater™ “than the adjusted’ +The 
corresponding slope for males aged 50-54 is also 1.08. The 
maximum slope for males is 1.33 (ages 0-1), while the maximum for 
females is 1.21 (ages 35-39). The corresponding minimum values 
are 1.03 for males aged 20-24, and 1.06 for females aged 65-69. 
The effect of the adjustments on the dispersion of mortality risk 


estimates is not large, an observation which is not consistent 


16 
with the assumption that small area mortality estimates are 
generally unstable. 

Piguressleeiliustrates ithe errects sotemsnr nk ing sand 
smoothing on the q5- In age group 0-1, the upper tail (including 
seven or more possible outliers) has been drawn in substantially. 
Note that this age group has the smallest proportion of non- 
Sampling variance of all age groups, as indicated by the @« Excess 
values in Table 1. Consequently, this age group is more strongly 
influenced by shrinking than other age groups, as indicated by 
the resistent line slope above. 

Figure 2 provides an indication of the qualitative 
differences there might be between the true distribution of 
mortality risks and the beta density that was chosen as a 
reference distribution. For example, density ratios for males 
aged) 50-54" "at generally “close toul.0. Thus, tor= this age group. 
the beta density may provide an adequate probability model. 
Perhaps more to the point, a two parameter (i.e., mean and 
variance) model appears adequate to describe the distribution of 
mortality risks for this age group. By contrast, density ratios 
for ages 0-1 suggest mortality risks among infants that are less 
skewed and have shorter tales than would be represented in a beta 
density. This is implied by the shape of the density ratio 
function, that is by the peak in the centre (values > 1.0) and 
low values (values < 1.0) in the tails. Density ratios similar to 
males aged 0-1 (i.e., indicating a lack of fit) are more common 


among other age groups than are density ratios similar to males 
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age 50-54. This would imply that three or four parameters may be 
required to approximate the distribution of mortality risks for 
most age groups. 

The results presented here are sensitive to the 
shrinkage parameters in Eqn. 8 (i.e., d, and dj) and to the 
choice of the smoothing parameter in Eqn.*10' (i.e., h). Thus, 
while the results may suggest that a four parameter model would 
be more appropriate tha a two parameter model, they do not 
provide a basis for significance tests (e.g., likelihood ratio 
tests). 

7. Quantile Life Tables 

Period life tables represent mortality conditions in 
terms of completed synthetic life times (e.g., forming survival 
curves from the product integral of age specific survival 
probabilities eqn Quantile life tables are a generalization 
of period life tables constructed on sections through the 
smoothed age specific mortality distributions of ote These 
sections link corresponding quantiles of different age groups. 
The resulting survival curves can be used to assess variability 
in durationsiof life y(i:e.jelife..expectancy”#atbirth;a'or the 
median age at death). 

Table 2 provides a comparison of previously published 
mortality probabilities (Nagnur, 1986 - termed AGGREGATE 
estimates in Table 2) with estimates employing the smoothed CD 
mortality probabilities and the empirical density functions. This 


comparison is intended to validate the life table results 
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presented here by demonstrating that the estimates are 
comparable in magnitude to more conventional estimates. Moreover, 
for most age groups, the order of MEAN, MEDIAN, and MODAL values 
clearly indicates positive skewness in the risk distributions. 

However, none of the estimates in Table 2 are strictly 
comparable as estimates of the same measure of location. In 
particular, the AGGREGATE and MEAN estimates differ in as much as 
the former may be represented as a weighted average of qs with K; 
as weights and the latter is related to a weighted average with 
the reciprocals of empirical variances as weights. A weighting 
scheme similar to that employed in Eqn. 5 could result in 
markedly smaller differentials among weights compared with 
differentials among population sizes. Thus, the observation that 
estimates employing CD data are frequently higher than the more 
conventional AGGREGATE estimates might imply a bias in the 
AGGREGATE estimates. Because of the differences in these 
particular estimation procedures, the issue of which estimator 
may have the greater bias is too complex to resolve here. 
Nevertheless, weighting data by population size may produce 
inappropriate results. 

Table 3 provides estimates of average and median 
duration of life from selected quantile life tables. These 
suggest that the variability in risks experienced within the 
Canadian population is consistent with a wide range of durations 
of life. Partial verification of these results is provided by 


comparing the low tail estimates to a group within the Canadian 


dice] 

population that is known to have had high mortality in the same 
time period. The expectation of life in the Registered Indian 
population in 1976 (representing about 1% of the Canadian 
population) was 60 years for males and 66 years for females (Rowe 
and Norris, 1985). Estimates based on quantile life tables are 
less extreme than the estimates for the Registered Indian 
population, which may indicate that the quantile estimates are 
conservative. 

Table 3 provides a perspective on recent Canadian 
mortality trends. Over the past two decades, life expectancy in 
Canada has been increasing by at least one year over each five 
year time interval. Such increases are not large in comparison to 
ches Guantties ranges In Tablee™3s, PAs™such, 1c is worth Vasking 
whether these trends result from general improvements in health 
or from improvements that are specific to a part of the 
population. That is, would risk distributions from earlier time 
periods be shifted upwards or would their upper tails be 
stretched relative to the 1976 distributions? The distinction has 
bearing on our expectations for future trends in mortality. 

Quantile life tables serve to focus attention on two 
elements of the ‘description of mortality conditions in 
heterogeneous populations. That is, on the need to allow for 
differences among individuals of the same age, and for 
differences that might exist in the same group of individuals at 
different ages. Survival curves may be constructed by specifying 


both a distribution of risk at each age and specifying a path 
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through the succession of risk distributions. Presumably, smokers 
who stop smoking follow a different path than smokers who do not. 
Quantities difegstables: mrenpresentmmpatismmtnvoughmeC comm ion 
distributions that are in a sense determined at birth. 
8. Discussion 

The key findings of this study might be summarized as 
follows: 

(a) Most of the variability among small area, age specific 
mortality probabilities reflects variability in mortality risks, 
rather than instability in small area estimates. 

(b) Differences among CD averages are large enough to be of 
importance for descriptive or policy purposes, even after 
adjustment for sampling variance and after smoothing. For 
example, recent national mortality trends represented by changes 
in life expectancy might reflect contraction in the tails of the 
risk distributions, rather than shifts at the centre. 

(c) Extreme quantile life tables provide a direct 
description of the range of mortality risk in a heterogeneous 
population by representing the best and worst synthetic life 
times that are consistent with the data. Quantile life tables 
also provide estimates of dispersion near the centre of mortality 
risk distributions. 

The interpretation of these results depends on the 
assumption that CD populations could be representative samples of 
individuals with independent risks. This view is very different 


from one which explicitly or implicitly assumes region specific 


Pash 
effects. However, it is also a view which more nearly accords 
with mortality conditions in which contagious disease plays a 
relatively minor role while life style plays a major one. 

These results might provide encouragement for further 
analysis of finely partitioned mortality data. Extensions might 
make use of partioning variables that can not be viewed as non- 
systematic (e.g., marital status), in addition to use of finer 
geographic resolution. But, for the present, replication of this 
work over a number of time periods might be the most promising 
extension. 
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TABLE 1 
Summary Measures of Variability 


in Census Division Mortality, Risk 
(Arcsin Transforms ts of q3) 


MALES FEMALES 

AGE Coefficient % Excess Coefficient % Excess 
GROUP of Variation of Variation 

Gee 14.7 ehh, 2 16.3 Oe 

1-4 28.4 S2e 20. 2 74.6 

eit, 24.8 CER 206. oO ah sit 
10-14 24,57 CS al PAT | 7 O10 
Lo Lo 23% 0 S13 25.0 Sin 
20-24 25.4 S210 aie. 3 Sie.0 
2o-29 215.4 S657 Zioe.O dL Ovaud 
30=—34 20no 84.0 24.0 1629 
Bo o Jey at Siz st 20% 2 7320 
40-44 16.4 Sits Oro Viere 
45-49 Send S35 ive. 6 SO a7 
50-54 2M, sha) 84.2 14.3 thse 
Sao lle. 3 Siang ste 2 peat line, 92) 
60-64 92-6 87.8 PA pe Sent 
5269 oes tf 36.7 Len 84.9 
Or) a oF So 3 Seat S026 
(heme fhe) on rattegies TO Ore 7, 
80-84 3 AP 84.6 ie Fergal 88.0 
sistas he) Oe 2 3510 Se et Wee eee) 
Coefficient of variation = rooney e 

% Excess - Variance in Excess of Sampling Variance as a Percent 

of Total Variance = 100 (m v)/ & (v + 1/K3) 
t - estimated average, v - Non-sampling variance, 


Gis population size, m - 259 Census Divisions 


AGE 
MALES 
Oma 
t=4 
o-9 
10-14 
L5=19 
20-24 
25-29 
30-34 
3D=39 
40-44 
45-49 
50-54 
ooo 
60-64 
G5—69 
70-74 
TST hs) 
80-84 
85-89 
FEMALES 

O=1 
1-4 
SFE) 
10-14 
b= 19 
20-24 
20-29 
30-34 
She bath 
40-44 
45-49 
50-54 
Soo 
60-64 
65-69 
10/4 
Pf ene fhe} 
80-84 
85=89 
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Table 2 


Comparison among Location Estimates 


for Mortality Probabilities q 


AGGREGATE 


0.0141 
O70032 
0.0023 
0.0021 
020075 
Omu0eS 
0.0074 
0.0078 
0.0107 
0.0164 
O2027'3 
0.0439 
0.0693 
0.1053 
One 5 7, 
OFZ 251 
0.3234 
0.4508 
OF D939 


0.0114 
0.0025 
0.0015 
0.0013 
0.0026 
0.0027 
0.0028 
0.0038 
0.0059 
0.0091 
0.0152 
0.0218 
O20Se9 
07. 0512 
0.0800 
On tZ2 67 
0.2010 
Ono gee 
0.4866 


MEAN 


0.0149 
0.0042 
0.0033 
0.0030 
0.0092 
Om 011s) 
0.0092 
0.0098 
0.0123 
0.0183 
0.0284 
0.0442 
0.0694 
0.1050 
OS DO 
0.2224 
Or 
0.4467 
0.6214 


OF.0123 
0.0035 
0.0024 
0.0020 
0.0036 
0.0036 
0.0036 
0.0047 
0.0072 
0.0103 
0.0151 
On O220 
0.0342 
Of 0513 
OF. 0725 
0.1282 
0.2037 
0.3318 
0.4965 


MEDIAN 


0.0146 
0.0039 
0.0032 
0.0028 
0.0087 
0.0111 
0.0088 
0.0095 
0.0118 
0.0178 
0.0279 
0.0437 
0.0687 
0.1054 
0.1499 
Om2219 
OFS 165 
0.4453 
0.6207 


0.0120 
0.0033 
0.0023 
0.0018 
0.0034 
0.0034 
0.0034 
0.0044 
0.0070 
0.0101 
0.0147 
0.0225 
0.0338 
0.0511 
0.0794 
Om T2799 
0.2024 
0.3324 
0.4919 


AGGREGATE - as published in Nagnur (1986) 
of total deaths to total population. 


MEAN, MEDIAN, MODAL - as determined by empirical Bayes estimates 
of mortality risk and empirical probability densities. 


MODAL 


Or 02.43 
0.0034 
0.0029 
OTO0LS 
OR DOF 
0.0099 
0.0081 
0.0089 
0755 0 Ks fe) 
0.0169 
0.0270 
0.0428 
050675 
0.1068 
0.1486 
0.2224 
0.3174 
0.4459 
OF Oro 


Or OD15 
0.0032 
0.0020 
0.0016 
0.0030 
0.0030 
0.0028 
020089 
O00 6H 
0.0097 
0.0144 
0.0220 
Om03)5,0 
0.0515 
0.0804 
OMe o7 
0.2010 
On 3.37 
0.4817 


based on the ratios 
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TABLE 3 


Quantile Estimates of Duration of Life 


Quantile Life Expectancy Median Age 
at Birth at Death 
MALES FEMALES MALES FEMALES 
1% 63.4 71.4 6829 it O 
5% 65.0 aCe: Tit) ea) 78.4 
10% 66775 74.3 UAL tps) ep 
25% 63.2 Oren FPA as 80.2 
50% spe hens! I ihoak CARS: ems Ae 
75% 71.4 Pigs ire 512 ea 
90% 1200 TiSbst! 76.4 Ban 0 
95% UShe 80.5 TEE elt 83.6 
99% thay S| 82.0 78.4 84.8 
MEAN 697.6 THE) Hele 61.0 
MODAL AO ete Wile & 74.0 Slee 
AGGREGATE 2.0. 2.0 Plgend O TSheas! lees 


MEAN, MODAL estimates obtained from mortality probabilities in 
Table 2. 

AGGREGATE - life expectancy as published in Nagnur (1986)- 
median ages at death interpolated from the published 
survivor curve. 
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